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Abstract 

The use of genetically isolated populations can empower next-generation association studies. In this review, we dis- 
cuss the advantages of this approach and review study design and analytical considerations of genetic association stu- 
dies focusing on isolates. We cite successful examples of using population isolates in association studies and outline 
potential ways forward. 
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Genome-wide association studies (GWAS) have met 
with widespread success in identifying common- 
frequency variants associated with complex diseases 
and medically relevant quantitative traits. Technolo- 
gical advances in genotyping and sequencing have 
recently enabled access to low-frequency and rare 
variant genotypes at the population level. The iden- 
tification of modest effects at individual low fre- 
quency and rare variants requires very large sample 
sizes. Power can be boosted by using statistical 
approaches to aggregate rare variants across chromo- 
somal regions or functional units. Power can also be 
increased by leveraging the unique characteristics of 
isolated, or founder, populations in genetic associ- 
ation studies. In the past, population isolates have 
typically been used in family-based genetic studies 
of Mendelian traits [1 , 2] . Advances in genotyping 
and sequencing technologies have catalysed the use 
of founder populations in genetic association studies 
of complex traits. Here, we review the population 
genetics characteristics of isolated populations, out- 
line study design and analytical considerations and 
discuss examples of next-generation association stu- 
dies in population isolates. 



PROPERTIES OF POPULATION 
ISOLATES 

Population isolates can be defined as subpopulations 
derived from a small number of individuals who 
became isolated because of a founding event (e.g. 
settlement of a new territory, famine, war, environ- 
mental disruption, infectious disease epidemics, social 
and/ or cultural barriers) and have stayed so for many 
generations. The resulting geographical and/or 
cultural isolation of these populations has genetic 
consequences, as, for example, endogamy (within 
community marriage) along with very restricted 
gene flow (immigration) from neighbouring popula- 
tions can often be observed. Thus, genomes tend to 
show higher homogeneity in isolates compared with 
cosmopolitan populations, which is reflected by a 
reduced effective population size (N e or the effective 
number of individuals required to explain the 
observed genetic variability) [3] (Figure 1). 

Another potentially advantageous property of 
population isolates is environmental and cultural 
homogeneity. In addition to reduced genetic com- 
plexity, individuals from an isolated population tend 
to share a common lifestyle, including diet, physical 
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activity levels and other cultural habits, and, import- 
antly, are exposed to similar environmental and 
sanitary conditions and disease vectors. Phenotype 
definition and diagnosis harmonization can also be 
achieved through standardized clinician training, a 
model adopted, for example, by Finland. 

Isolates can branch and develop independently; 
geographical proximity and common origins do 
not imply identical evolution. Many isolates experi- 
ence more than one founding event, which can 
result in population bottlenecks [4] or the creation 
of regional subisolates [5]. A typical example is 
Finland, where both older (~2000 years) and 
younger (~500 years) population isolates have 
appeared within one geographical region due to in- 
ternal migration [6]. Various factors such as the total 
number of founders, number and intensity of bottle- 
necks, levels of endogamy, age and duration of iso- 
lation shape the demographic history of isolates 
(Figure 1). Detailed genealogical records are some- 
times available for isolated populations. For example, 
the Icelandic population has an extensive genealo- 
gical and disease history database [7, 8]. Given 
genome-wide data availability, the individual char- 
acteristics of isolates can be examined using popula- 
tion genetics analysis tools [9—12]. 



GENETIC CONSEQUENCES OF 
ISOLATION 

Reduced haplotype complexity 

Thanks to the HapMap [13] and 1000 Genomes 
Projects [14], we now have a good understanding 
of linkage disequilibrium (LD) patterns across the 
human genome for several human ethnic groups. 
In isolated populations, LD tends to extend over 
longer distances compared with non-isolated popu- 
lations, as exemplified by studies in the populations 
of the Central Valley of Costa Rica [15], Palau [16] 
and Val Borbera in northwest Italy [9]. As expected 
through ancestral recombination, the LD intervals of 
older isolates tend to be shorter than those of 
younger isolates [17]. Moreover, relatively higher 
levels of LD are observed in isolates with a small 
number of founders that experience slow growth 
during the early generations following the initial 
bottleneck [18]. Longer stretches of LD in isolates 
mean longer haplotypes, thus facilitating disease as- 
sociation studies and empowering imputation 
approaches that infer genotypes at untyped variants 
based on regional LD information [19]. The 
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Figure I: Factors that shape population isolate 
characteristics. 

disadvantage of high levels of correlation among 
sequence variants is a reduction in resolution of the 
localization of causal variants within a wide associ- 
ation peak signal. Trans-ethnic meta-analysis has 
been proposed as an approach to synthesize data 
across populations with diverse LD patterns to 
enable fine-mapping of the causal variant (s) [20]. 

Reduced allelic variability and 
genetic drift 

Isolation can influence the patterns and prevalence of 
disease. Notably, owing to the enrichment of some 
rare alleles resulting from the combined effect of en- 
dogamy, bottlenecks, genetic drift, recurrent muta- 
tion and selection, isolates have been shown to 
potentially exhibit an increased incidence of recessive 
disorders. Each isolate shows a unique profile of rare 
disease alleles [21], which can be expressed through a 
higher prevalence of some diseases and lower inci- 
dence of others [22, 23]. For example, the Pima 
Indians of Arizona have a very high prevalence of 
type 2 diabetes (~20%) [24, 25] and near absence of 
type 1 diabetes [26]. 

In population isolates, certain alleles reach fixation 
or extinction at a particular locus, thus reducing the 
extent of genetic variability [27, 28]. Some variants 
that contribute to complex traits/diseases are rare in 
the parent population and drifted to higher fre- 
quency in the isolate. The enrichment of low- 
frequency alleles in the study population can 
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empower the identification of these variants with 
smaller discovery sets. For example, a null mutation 
in APOC3 that had risen in frequency in the Amish 
founder population was found to be associated with a 
favourable plasma lipid profile. Association with this 
variant (previously thought to be private to the 
Amish) was recently replicated in an exome chip- 
wide association scan for lipid traits in ~1200 indi- 
viduals from a Greek population isolate [29]. To 
achieve 80% power to detect the observed effect 
size in the general European population (in which 
the variant has 40-fold lower frequency), a sample 
size of 67 000 individuals would have been required. 
The phenomenon of reduced allelic variability, com- 
bined with extended LD, is expected to improve 
power for trait association at rare variants compared 
with populations with wider allelic diversity (noting 
that other rare variants will be lost). 

STUDY DESIGN 

Population choice is an important consideration in 
designing a genetic association study focusing on iso- 
lates. Factors such as the number of founding haplo- 
types, age of divergence from the parent population, 
effective sample size and degree of admixture with 
neighbouring populations, all play a role in the 
population's allelic architecture. For initial GWAS 
as well as rare Mendelian gene discovery, the study 
of young founder populations with recent expansion 
(e.g. late-settlement Finland) is a powerful strategy 
[30] because of their higher degree of LD and 
reduced genetic diversity. It has been suggested 
that small populations that have remained stable 
throughout most of their history could lead to 
more economical locus discovery efforts [31]. Drift 
of rare alleles occurs at random and for a small set of 
variant sites; therefore, the power of a genetic asso- 
ciation study in a population isolate will depend on 
the enrichment of alleles that are relevant to a 
phenotype of interest [32—34], which requires the 
alleles in question to have passed through the initial 
population bottleneck. An association study in an 
isolate can often be motivated by a suspected 
higher prevalence of a trait or disease in that particu- 
lar population. 

TO TYPE OR TO SEQUENCE? 

GWAS arrays by definition assay primarily variants 
selected to represent common frequency variation 



across the genome. Low-frequency and rare variants 
genome-wide cannot be easily captured on a geno- 
typing array because of their large numbers and low 
levels of correlation. Recently, a genotyping chip 
focused on likely functional coding low-frequency 
and rare variants — the exome chip — has been used 
successfully in founder populations to associate rare 
variants with proinsulin [35] and HDL cholesterol 
[29] levels. The decreasing cost of sequencing 
makes it increasingly easier to study the complete 
variation landscape irrespective of allele frequency 
[36]. 

Whole exome sequencing has the advantage of 
reduced cost compared with whole-genome sequen- 
cing, but does not capture variation in non-genic 
regions. Previous experience from GWAS strongly 
indicates that the majority of complex trait signals 
reside outside of genes. High-depth sequencing is 
considered necessary to call high-quality variants 
across the allele frequency spectrum [37]. 
However, it has been shown that in the context of 
a population study, whole-genome sequencing many 
individuals at low depth can have variant detection 
power advantages over fewer individuals sequenced 
at higher depth [38, 39]. This approach was tested by 
the 1000 Genomes Project pilot and has been used 
to generate the widely used phase 1 and phase 2 
variant sets. 

When not all samples can be sequenced, whole- 
genome sequencing of a subset of cases and controls 
following an initial GWAS has proven to be a suc- 
cessful strategy for empowering rare variant associ- 
ation in dichotomous trait studies [40, 41]. Variants 
from the sequenced samples are phased using long- 
range haplotype phasing (LRP; see Imputation), then 
imputed back into the whole sample set, which is 
equivalent to using the sequenced subset as a refer- 
ence panel for imputation. 

ANALYTICAL CONSIDERATIONS 
Relatedness 

One intrinsic consequence of genetic isolation is re- 
latedness among individuals, which can conflict with 
the assumptions of independence of many com- 
monly used analysis tools and inflate test statistics 
affecting association signals. An efficient approach is 
to account for relatedness in the association analysis 
through the use of a linear mixed model (LMM). 
Until recently, computation of an exact association 
test statistic such as the Wald statistic or likelihood 
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ratio (implemented in EMMA [42]) was computa- 
tionally impractical. Tools that compute approxi- 
mate solutions, either by using the residuals from 
the LMM under the null model as phenotypes, 
such as GenABEL [43], or by avoiding the repeated 
estimation of variance components, such as 
TASSEL [44] or EMMAX [45], have recently been 
developed. Mathematically optimized versions of the 
exact test, such as GEMMA [46] or FaST-LMM 
[47], have also been developed and are widely used. 

Several methods have been proposed to improve 
on the power of single-point tests for rare variants by 
combining information across multiple sites in a 
chromosomal region and testing for association 
with the trait of interest [48—51]. Relatedness infor- 
mation can be incorporated in the model, such as in 
famSKAT [52] or other tools [53-57]. 

Imputation 

When performing association based on genotyping 
arrays, it is common practice to impute untyped vari- 
ants based on a reference panel (e.g. the 1000 
Genomes Project (www.1000genomes.org) and/or 
the UK10K study data (www.uklOk.org) to enhance 
the resolution of the study. This approach, where 
positions that were not genotyped in the sample 
are added using phase information in the reference 
set, is also relevant to refining genotype calls for low- 
depth sequencing data. 

Imputation is closely related to phasing, a proced- 
ure that infers haplotypes based on identity by state 
(IBS), with other phased individuals. Relatedness is 
helpful for phasing because it increases the likelihood 
of finding a long IBS string of variants; the more 
related the samples, the more certain it is that these 
IBS sequences are actually inherited identical- 
by-descent (IBD), and the probability that unobserved 
positions are IBS becomes quantifiable [58, 59]. Kong 
et al. [60] proposed LRP, a method that uses regions 
of IBD between related individuals within the sam- 
ple, to phase and impute variants. This approach has 
been generalized and improved to achieve higher ac- 
curacy around recombination sites in e.g. SLRP [61], 
ANCHAP [62] and other methods [63]. 

Meta-analysis 

The synthesis of data through meta-analysis can in- 
crease the power of association studies. Two different 
classes of methods have been typically applied in 
traditional meta-analysis of GWAS: P-value-based 
tests and effect size-based methods, which can be 



further subdivided into fixed or random effects 
models [64]. Fixed effects models assume that the 
same underlying effect is present in all studies, 
whereas random effects models allow for effect 
sizes to be different. These approaches can be applied 
to meta-analysing data across isolates. However, in 
the era of rare variant association testing, allelic het- 
erogeneity can decrease power either because of the 
presence of similarly associated multiple rare variants 
or different ethnic backgrounds in the populations 
being meta-analyzed [65]. In addition, meta-analysis 
generally assumes independence of the study sam- 
ples, which does not hold in the case of within- 
isolate meta-analysis. Research in this field is still 
ongoing [20], and a continued effort in method 
development is needed. 

COMPLEX TRAIT LOCUS 
DETECTION IN FOUNDER 
POPULATIONS 

Several studies from Iceland have been successful in 
identifying low- /rare-frequency variants associated 
with sick sinus syndrome, gout, prostate cancer and 
Alzheimer's disease [41, 66—68]. In a recent study in 
Finland, four novel loci were found to be associated 
with saccular intracranial aneurysms, a complex trait 
with a sporadic and a familial form [69]. One of these 
variants has drifted up 15 times in frequency com- 
pared with the Dutch general population and is vir- 
tually non-existent in other populations from the 
1000 Genomes Project. A genome-wide significant 
risk locus for schizophrenia and bipolar disorder has 
been identified in an ethnically homogeneous cohort 
of Ashkenazi Jewish individuals [70] . The top signal 
(rs 11098403) is an inter-genic variant located in 
NDST3 and was replicated in 11 independent 
cohorts of varying ethnicities. Recently, a Greek iso- 
lated population replicated a genome-wide signifi- 
cant association between R19X, a cardioprotective 
variant in APOC3, and low blood triglyceride levels 
[29]. This study also demonstrated that associations 
discovered in population isolates can be generaliz- 
able, as the same variant (R19X) had previously 
been discovered in the Amish founder population 
[71]. 

FUTURE DIRECTIONS 

Population isolates are uniquely positioned to usher 
in the new era of sequence-based association studies. 
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The relative power advantages afforded by studying 
isolates have been well-documented and recently 
exemplified in the literature through successful iden- 
tification of complex trait loci that replicate in other 
populations. Not all novel discoveries in isolates can 
be recapitulated in other populations. However, pri- 
vate variants detected in isolates can importantly 
point to novel biology, identifying potential path- 
ways and loci involved in the aetiopathogenesis of 
clinically relevant traits. Large-scale efforts to 
synthesize genome-wide data across isolated popula- 
tions are an intuitive next step in the field, and hold 
the promise of catalysing the discovery of further 
complex trait-associated variants. 



FUNDING 

This work is supported by the Wellcome Trust 
(098051) and the European Research Council 
(ERC-2011-StG 280559-SEPI). 



References 

1. Sheffield VC, Stone EM, Carmi R. Use of isolated inbred 
human populations for identification of disease genes. Trends 
Genet 1998;14:391-6. 

2. Puffenberger EG, Hosoda K, Washington SS, et al. A mis- 
sense mutation of the endothelin-B receptor gene in multi- 
genic Hirschsprung's disease. Cell 1994;79:1257-66. 

3. Charlesworth B. Fundamental concepts in genetics: effect- 
ive population size and patterns of molecular evolution and 
variation. Nat Rev Genet 2009;10:195-205. 

4. Peltonen L, Pekkarinen P, Aaltonen J. Messages from an 
isolate: lessons from the Finnish gene pool. Biol Chem Hoppe 
Seyler 1995;376:697-704. 

5. Jorde LB, Watkins WS, Kere J, et al. Gene mapping in 
isolated populations: new roles for old friends? Hum Hered 
2000;50:57-65. 

6. Peltonen L, Palotie A, Lange K. Use of population iso- 
lates for mapping complex traits. Nat Rev Genet 2000;1: 
182-90. 

7. deCODE genetics. http://www. decode. com (May 2004, 
date last accessed). 



8. Statistics Iceland, http://www.statice.is/ (May 2004, date 
last accessed). 

9. Colonna V, Pistis G, Bomba L, et al. Small effective popu- 
lation size and genetic homogeneity in the Val Borbera 
isolate. EurJHum Genet 2013;21:89-94. 

10. Esko T, Mezzavilla M, Nelis M, etal. Genetic characteriza- 
tion of northeastern Italian population isolates in the con- 
text of broader European genetic diversity. EurJHum Genet 
2013;21:659-65. 

11. Price AL, Helgason A, Palsson S, et al. The impact of 
divergence time on the nature of population structure: 
an example from Iceland. PLoS Genet 2009;5:el000505. 

12. Vitart V, Biloglav Z, Hayward C, etal. 3000 years of soli- 
tude: extreme differentiation in the island isolates of 
Dalmatia, Croatia. EnrJ Hum Genet 2006;14:478-7. 

13. International HapMap C, Altshuler DM, Gibbs RA, etal. 
Integrating common and rare genetic variation in diverse 
human populations. Nature 2010;467:52-8. 

14. 1000 Genomes Project Consortium, Abecasis GR, Auton 
A, etal. An integrated map of genetic variation from 1,092 
human genomes. Nature 2012;491:56-65. 

15. Service SK, Ophoff RA, Freimer NB. The genome-wide 
distribution of background linkage disequilibrium in a 
population isolate. Hum Mol Genet 2001;10:545-51. 

16. Devlin B, Roeder K, Otto C, etal. Genome-wide distribu- 
tion of linkage disequilibrium in the population of Palau 
and its implications for gene flow in Remote Oceania. 
Hum Genet 2001;108:521-8. 

17. Varilo T, Peltonen L. Isolates and their potential use in 
complex gene mapping efforts. Curr Opin Genet Dev 2004; 
14:316-23. 

18. Kruglyak L. Prospects for whole-genome linkage disequi- 
librium mapping of common disease genes. Nat Genet 1999; 
22:139-44. 

19. Marchini J, Howie B, Myers S, et al. A new multipoint 
method for genome-wide association studies by imputation 
of genotypes. Nat Genet 2007;39:906-13. 

20. Morris AP. Transethnic meta-analysis of genomewide asso- 
ciation studies. Genet Epidemiol 2011;35:809-22. 

21. Arcos-Burgos M, Muenke M. Genetics of population iso- 
lates. Clin Genet 2002;61:233-47. 

22. Charrow J. Ashkenazi Jewish genetic disorders. Earn Cancer 
2004;3:201-6. 

23. Norio R. Finnish Disease Heritage I: characteristics, causes, 
background. Hum Genet 2003;112:441-56. 

24. Knowler WC, Bennett PH, Hamman RF, et al. Diabetes 
incidence and prevalence in Pima Indians: a 19-fold greater 
incidence than in Rochester, Minnesota. Am J Epidemiol 
1978;108:497-505. 

25. Baier LJ, Hanson RL. Genetic studies of the etiology of type 
2 diabetes in Pima Indians: hunting for pieces to a compli- 
cated puzzle. Diabetes 2004;53:1181-6. 

26. Dabelea D, Hanson RL, Bennett PH, etal. Increasing preva- 
lence of Type II diabetes in Anerican Indian children. 
Diabetologia 1998;41:904-10. 

27. Otto SP, Whitlock MC. The probability of fixation in 
populations of changing size. Genetics 1997;146:723-33. 

28. Andrews CA. Natural Selection, Genetic Drift and Gene 
Flow do Not Act in Isolation in Natural Populations. Nature 
Education Knowledge 2010;3:5. 



Key points 

• The population genetics characteristics of an isolate depend on 
demographic history and the number of bottlenecks/founding 
events, total number of founders, levels of endogamy, age and 
duration of isolation. 

• Genetic consequences of isolation such as reduced haplotype di- 
versity and genetic drift can enhance the power for locus identi- 
fication in genetic association studies of complex traits. 



376 



Hatzikotoulas et al. 



29. Tachmazidou I, Dedoussis G, Southam L, etal. A rare func- 
tional cardioprotective APOC3 variant has risen in fre- 
quency in distinct population isolates. Nat Commun 2013;4: 
2872. 

30. Kristiansson K, Naukkarinen J, Peltonen L. Isolated popu- 
lations and complex disease gene identification. Genome Biol 
2008;9:109. 

31. Terwilliger JD, Zollner S, Laan M, et al. Mapping 
genes through the use of linkage disequilibrium 
generated by genetic drift: 'drift mapping' in small popula- 
tions with no demographic expansion. Hum Hered 1998;48: 
138-54. 

32. Wright AF, Carothers AD, Pirastu M. Population choice in 
mapping genes for complex diseases. Nat Genet 1999;23: 
397-404. 

33. Stephens JC, Bamshad M. Population choice as a consider- 
ation for genetic analysis study design. Cold Spring Harb Protoc 
2011;2011:917-22. 

34. Zuk O, Schaffner SF, Samocha K, et al. Searching for miss- 
ing heritability: designing rare variant association studies. 
Proc Natl Acad Sci USA 2014;lll:E455-64. 

35. Huyghe JR, Jackson AU, Fogarty MP, etal. Exome array 
analysis identifies new loci and low-frequency variants influ- 
encing insulin processing and secretion. Nat Genet 2013;45: 
197-201. 

36. Kiezun A, Garimella K, Do R, et al. Exome sequencing 
and the genetic basis of complex traits. Nat Genet 2012;44: 
623-30. 

37. Do R, Kathiresan S, Abecasis GR. Exome sequencing and 
complex disease: practical aspects of rare variant association 
studies. Hum Mol Genet 2012;21:Rl-9. 

38. Le SQ, Durbin R. SNP detection and genotyping from 
low-coverage sequencing data on multiple diploid samples. 
GenomeR.es 2011;21:952-60. 

39. Li Y, Sidore C, Kang HM, etal. Low-coverage sequencing: 
implications for design of complex trait association studies. 
GenomeRcs 2011;21:940-51. 

40. Zeggini E. Next-generation association studies for complex 
traits. Nat Genet 2011;43:287-88. 

41. Holm H, Gudbjartsson DF, Sulem P, etal. A rare variant in 
MYH6 is associated with high risk of sick sinus syndrome. 
Nat Genet 2011;43:316-20. 

42. Kang HM, Zaitlen NA, Wade CM, etal. Efficient control of 
population structure in model organism association map- 
ping. Genetics 2008;178:1709-23. 

43. Aulchenko YS, Ripke S, Isaacs A, et al. GenABEL: an R 
library for genome-wide association analysis. Bioinformatics 
2007;23:1294-6. 

44. Zhang Z, Ersoz E, Lai CQ, et al. Mixed linear model ap- 
proach adapted for genome-wide association studies. Nat 
Genet 2010;42:355-60. 

45. Kang HM, Sul JH, Service SK, et al. Variance component 
model to account for sample structure in genome-wide as- 
sociation studies. Nat Genet 2010;42:348-54. 

46. Zhou X, Stephens M. Genome-wide efficient mixed- 
model analysis for association studies. Nat Genet 2012;44: 
821-4. 

47. Lippert C, Listgarten J, Liu Y, et al. FaST linear mixed 
models for genome-wide association studies. Nat Methods 
2011;8:833-5. 



48. Chen H, Hendricks AE, Cheng Y, et al. Comparison of 
statistical approaches to rare variant analysis for quantitative 
traits. BMCProc 2011;5(Suppl 9):S113. 

49. Asimit J, Zeggini E. Rare variant association analysis meth- 
ods for complex traits. AnnuRev Genet 2010;44:293-308. 

50. Bansal V, Libiger O, Torkamani A, et al. Statistical analysis 
strategies for association studies involving rare variants. Nat 
Rev Genet 2010;11:773-85. 

51. Basu S, Pan W. Comparison of statistical tests for disease 
association with rare variants. Genet Epidemiol 2011;35:606— 
19. 

52. Chen H, Meigs JB, Dupuis J. Sequence kernel association 
test for quantitative traits in family samples. Genet Epidemiol 
2013;37:196-204. 

53. Schaid DJ, McDonnell SK, Sinnwell JP, etal. Multiple gen- 
etic variant association testing by collapsing and kernel 
methods with pedigree or population structured data. 
Genet Epidemiol 2013;37:409-18. 

54. Wang X, Lee S, Zhu X, et al. GEE-based SNP set associ- 
ation test for continuous and discrete traits in family-based 
association studies. Genet Epidemiol 2013;37:778-86. 

55. Listgarten J, Lippert C, Kang EY, et al. A powerful and 
efficient set test for genetic markers that handles confoun- 
ders. Bioinformatics 2013;29:1526-33. 

56. Oualkacha K, Dastani Z, Li R, et al. Adjusted sequence 
kernel association test for rare variants controlling for cryptic 
and family relatedness. Genet Epidemiol 2013;37:366-76. 

57. Jiang D, McPeek MS. Robust rare variant association testing 
for quantitative traits in samples with related individuals. 
Genet Epidemiol 2014;38:10-20. 

58. Powell JE, Visscher PM, Goddard ME. Reconciling the 
analysis of IBD and IBS in complex trait studies. Nat Rev 
Genet 2010;11:800-805. 

59. Carmi S, Palamara PF, Vacic V, etal. The variance of iden- 
tity-by-descent sharing in the Wright-Fisher model. 
Genetics 2013;193:911-28. 

60. Kong A, Masson G, Frigge ML, etal. Detection of sharing 
by descent, long-range phasing and haplotype imputation. 
Nat Genet 2008;40:1068-75. 

61. Palin K, Campbell H, Wright AF, etal. Identity-by-descent- 
based phasing and imputation in founder populations using 
graphical models. Genet Epidemiol 2011;35:853-60. 

62. Glodzik D, Navarro P, Vitart V, et al. Inference of identity 
by descent in population isolates and optimal sequencing 
studies. EurJHum Genet 2013;21:1140-5. 

63. Uricchio LH, ChongJX, Ross KD, etal. Accurate imput- 
ation of rare and common variants in a founder population 
from a small number of sequenced individuals. Genet 
Epidemiol 2012;36:312-9. 

64. Zeggini E, Ioannidis JP. Meta-analysis in genome-wide as- 
sociation studies. Pharmacogenomics 2009;10:191-201. 

65. Asimit J, Day- Williams A, Zgaga L, etal. An evaluation of 
different meta-analysis approaches in the presence of allelic 
heterogeneity. EurJ Hum Genet 2012;20:709-12. 

66. Sulem P, Gudbjartsson DF, Walters GB, etal. Identification 
of low-frequency variants associated with gout and serum 
uric acid levels. Nat Genet 2011;43:1127-30. 

67. Jonsson T, Atwal JK, Steinberg S, etal. A mutation in APP 
protects against Alzheimer's disease and age-related cogni- 
tive decline. Nature 2012;488:96-9. 



Using population isolates in genetic association studies 



377 



68. Gudmundsson J, Sulem P, Gudbjartsson DF, etal. A study based 
on whole-genome sequencing yields a rare variant at 8q24 
associated with prostate cancer. Nat Genet 2012;44:1326-9. 

69. Kurki MI, Gaal EI, Kettunen J, etal. High risk population 
isolate reveals low frequency variants predisposing to intra- 
cranial aneurysms. PLoS Genet 2014;10:el004134. 



70. Lencz T, Guha S, Liu C, et al. Genome-wide association 
study implicates NDST3 in schizophrenia and bipolar dis- 
order. Nat Commun 2013;4:2739. 

71. Pollin TI, Damcott CM, Shen H, etal. A null mutation in 
human APOC3 confers a favorable plasma lipid profile and 
apparent cardioprotection. Science 2008;322:1702-5. 



